Class Proportion Estimation with Application to Multiclass Anomaly Rejection
نویسندگان
چکیده
This work addresses two classification problems that fall under the heading of domain adaptation, wherein the distributions of training and testing examples differ. The first problem studied is that of class proportion estimation, which is the problem of estimating the class proportions in an unlabeled testing data set given labeled examples of each class. Compared to previous work on this problem, our approach has the novel feature that it does not require labeled training data from one of the classes. This property allows us to address the second domain adaptation problem, namely, multiclass anomaly rejection. Here, the goal is to design a classifier that has the option of assigning a “reject” label, indicating that the instance did not arise from a class present in the training data. We establish consistent learning strategies for both of these domain adaptation problems, which to our knowledge are the first of their kind. We also implement the class proportion estimation technique and demonstrate its performance on several benchmark data sets.
منابع مشابه
Gene-Based Multiclass Cancer Diagnosis with Class-Selective Rejections
Supervised learning of microarray data is receiving much attention in recent years. Multiclass cancer diagnosis, based on selected gene profiles, are used as adjunct of clinical diagnosis. However, supervised diagnosis may hinder patient care, add expense or confound a result. To avoid this misleading, a multiclass cancer diagnosis with class-selective rejection is proposed. It rejects some pat...
متن کاملGaussian Mixture Models for multiclass problems with performance constraints
This paper proposes a method using labelled data to learn a decision rule for multiclass problems with class-selective rejection and performance constraints. The method is based on class-conditional density estimations obtained by using the Gaussian Mixture Models (GMM). The rule is thus determined by plugging these estimations in the statistical hypothesis framework and solving an optimization...
متن کاملImbalanced Multiclass Data Classification Using Ant Colony Optimization Algorithm
Class imbalance problems have drawn increasing interest lately because of its classification trouble caused by imbalanced class deliveries and poor prediction performance for minority class. This problem is particularly common in preparation and can be detected in various disciplines including fraud detection, anomaly detection, oil spillage detection, medical diagnosis, facial recognition. Man...
متن کاملPolysulfone Ultrafiltration Membranes Modified with Carbon-Coated Alumina Supported NiTiO2 Nanoparticles for Water Treatment: Synthesis, Characterization and Application
This paper reports on the synthesis and characterisation of polysulfone (PSf) ultrafltration (UF) membranes modifed with carbon coated alumina Ni-doped titanium dioxide (CCA/Ni-TiO2) nanoparticles. The syntheses of the membranes was carried out using the phase inversion process. The ...
متن کاملAnti - Profiles for Anomaly Classification and Regression
Title of dissertation: ANTI-PROFILES FOR ANOMALY CLASSIFICATION AND REGRESSION Wikum Dinalankara, Doctor of Philosophy, 2015 Dissertation directed by: Professor Hèctor Corrada-Bravo Department of Computer Science Anomaly detection is a classical problem in Statistical Learning with widereaching applications in security, networks, genomics and others. In this work, we formulate the anomaly class...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014